Intelligent Memory : An Architecture forLock - Free Synchronization
نویسندگان
چکیده
This paper presents intelligent memory, a new memory architecture capable of providing e cient lock-free synchronization. In the intelligent memory, a sequence of operations on a shared object associated with that memory module can be processed without any intervention so that an environment for the synchronization can be provided by executing a critical section itself in that memory module. For this, we present a memory architecture for intelligent memory having minimal instruction set and develop a programming model, called Critical Section Procedure (CSP), which consists of a shared data structure and operations on it. Intelligent memory is intended to eliminate waste of processing time such as busy waiting in spin lock and retry due to process contention in existing lock-free synchronization schemes. Simulation results show that intelligent memory provides better throughput compared with a spin lock and the existing lock-free synchronization.
منابع مشابه
Towards Guided Data Forwarding using Intelligent Memory
Memory access is one of today’s performance bottlenecks, as memory latencies are lagging behind the access times needed by advances in processor speeds. In this paper, we propose an intelligent memory data forwarding architecture where the data of load instructions are forwarded from the memory side in advance of their use in the main processor. We propose a solution combining two new approache...
متن کاملAn Algorithm for Register-Synchronized Precomputation In Intelligent Memory Systems
This paper presents a novel compiler algorithm for selecting program slices that prefetch load values concurrently with program execution. The algorithm is evaluated in the context of an intelligent memory system. The architecture consists of a main processor and a simple memory processor. The intelligent memory system pre-executes program slices and forwards values of critical loads to the mai...
متن کاملMultithreading for synchronization tolerance in matrix factorization
Physical constraints such as power, leakage and pin bandwidth are currently driving the HPC industry to produce systems with unprecedented levels of concurrency. In these parallel systems, synchronization and memory operations are becoming considerably more expensive than before. In this work we study parallel matrix factorization codes and conclude that they need to be re-engineered to avoid u...
متن کاملA Context-aware Architecture for Mental Model Sharing through Semantic Movement in Intelligent Agents
Recent studies in multi-agent systems are paying increasingly more attention to the paradigm of designing intelligent agents with human inspired concepts. One of the main cognitive concepts driving the core of many recent approaches in multi agent systems is shared mental models. In this paper, we propose an architecture for sharing mental models based on a new concept called semantic movement....
متن کاملEfficient Fine Grained Synchronization Support Using Full/Empty Tagged Shared Memory and Cache Coherency
Performance results of machines with fine-grain synchronization on individual lock-free data items (e.g., words), such as the MIT Alewife multiprocessor, illustrate the benefits of supporting fine-grain synchronization. The performance benefits are primarily the result of allowing a dataflow style of computation in programming models, and maximizing the exposed parallelism by minimizing the pos...
متن کامل